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ABSTRACT 


Applications of the general linear model in experimental 
design and analysis usually involve design matrices of less 
Pialewiieencetinimerank. This may present a problem in deter- 
mining what elements and functions of the parameter vector 
are estimable and what hypotheses are testable. This thesis 
discusses two methods of answering questions about estima- 
bility and testability, where the form of the design matrix 
determines the method to be used. The two methods, both of 
which can use computer routines, are: (1) direct mathematical 
computational approach, and (2) a modification of an analysis 
of variance routine, with a special case of this method using 
a modified ANOVA routine and solutions to systems of linear 
equations. Confounding of effects is developed mathematically 
in connection with determining estimable functions. Methods 
discussed in this thesis can be applied to the area of Army 


Meat and mvaluation. 
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1. INTRODUCTION 


The model of interest is the general linear model: 
y= Xbd+ e 
where y is an nxl vector of observations, 
aces nxp Matrix of knew values, 
bis a pxl vector of parameters, 
and @ is a vector of random error terms. 
Throughcut this thesis a capital letter will denote a matrix, 
with a prime or superscript of "-1" representing its transpose 
or inverse respectively. A lower case letter with a tilda (~) 
above it will denote a vector with a prime again representing 
its transpose. This model is used in regression analysis and 
design analysis / Ref. 10 /. The derivation of a least squares 
estimator for b involves minimizing the sum of squares of the 
difference between the observations and their expected values. 
This problem can be further reduced to the problem of solving 


the normal equations 


X'XH = XZ, a 
as 


in which b is the estimator of b. In the case where X is of 
full rank p, a unique solution to the normal equations exists. 


A solution to the normal equations may then be written in form 


> 


al 


b= (xx) X'y (2) 








- 


The solution b found under these conditions can be shown to 
be the best linear unbiased estimator (b.l.u.e.) of Db. Refer- 
ences 4, 9, and 10 contain proofs of these statements. 

If the model is not of full rank, then (xexy-t does not 

exist and a solution to the normal equations may be written 

in terms of a generalized inverse of X'X. A generalized inverse 
G of a matrix A is definedto be any matrix having the property 
that AGA =A. The linear model with a matrix X that is not of 
full column rank can arise in various ways. For example, the 
nature of the experiment may result in a design matrix X that 
is not of full column rank, or the experimenter may have had 
problems that caused the experiment not to be conducted in 
accordance with an original full rank design. In this case 

an analyst may wish to perform a "salvage operation" to gain 
as much information as possible from the data derived from 

the experiment. In any case, the analyst is faced with these 
questions: 

(1) What is the actual form of the experiment conducted 
(Great 1c) xX)? 

(2) What inferences can be made from the information 
bil ee (i.e., what effects (elements and functions 
of b) are estimable and what hypotheses are testable)? 

With the less than full rank model, the solution to the 

normal equations is not unique; rather, many solutions exist. 


Based on a generalized inverse G of X*X, a solution to (2) 


may be written in the form 


me = CX*y (3) 





Por each generalized inverse G, discussed in detail by Pringle 
and Rayner / Ref. 8 /, there is a solution b° given by (3), 
and conversely (i.e., all solutions can be expressed in the 
form of (3)). Searle / Ref. 10_/ states that the normal equa- 
tions are consistent and that the solutions to (1) are given 
by (3) if and only if G is a generalized inverse of X'X. 

This thesis develops and discusses several methods for an 
experimenter to determine the estimability and testability of 
linear functions of the parameter vector. First, a mathemat- 
ically straightforward solution to these problems will be 


outlined. Then, the use of an analysis of variance computer 


routine for implementation of this solution is discussed, 





II. DEVELOPMENT 


To answer questions about testability, several topics 
must be discussed, including the generalized inverse of X'X 


and estimability of a linear function of Db. 


A. GENERALIZED INVERSE MATRIX 


An introduction to the theory of generalized inverses is 
contained in Searle / Ref. 10 7. A detailed discussion is 
available in Pringle and Rayner / Ref. 8 7. Throughout this 
paper the symbol G will be used to represent a generalized 
inverse of X'X. As discussed in Reference 8, the matrix G 
has many alternate names such as "pseudo-inverse," "conditional 
inverse" and "g-inverse," which makes identical information 
available in the literature under several different names. 
Searle presents the following important properties of a gen- 
eralized inverse: 

Theorem 1. When G is a generalized inverse of the matrix 

X'X, then 
1. G*t is also a generalized inverse of X'X; 
2. XGX'*X = X; GX' is a generalized inverse of X; 
3. XGX* is invariant to G. 

Various methods for computing a generalized inverse are 
described in detail by Searle and by Pringle and Rayner. The 
properties of G stated in Theorem 1 are important for the 
derivations in this thesis, but the direct computation of G 


will not be required for reasons described in the next chapter. 
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B. EXPECTED VALUES AND THE MATRIX H 


Since X'X is, in general, not of full rank, and thus equa- 
tion (2) cannot be solved for a unique solution : = (X*X) 7x $ 
as in the full rank model, the normal equations for the less 
than full rank model are written as 

X*Xb° = X°Y 
in which b° denotes any one of the many solutions that exist. 
Letting G denote a generalized inverse matrix of X'X, then the 
corresponding solution is given by 


b° = GXty (4) 


The expected value of be is given by 
E(b°) = GX'E(}) = GX'Xb = HD; 


where H = GX*X. According to Searle / Ref. 10 /, the matrix 
H is unique although G is not. For this case, note that b° 


moan dabracsed estimator of Hb, Pawicm abana Of d. 
C. ESTIMABLE FUNCTIONS 


Searle / Ref. 10 / states that a linear function of para- 
meters, in this case of components of b, iewestimable 2f a¢ 
is identically equal to some linear function of the expected 
value of a the vector of observations. In other words, q*b 
is estimable if and only if Q'b = t'E(}) for some vector oo 


The vector t+! may not be unique. 


ae 

















There are several important properties of estimable 
functions that will be necessary in determining which hypoth- 
eses are testable (see Reference 10). 

1. Linear combinations of estimable functions are 
estimable. 

2. All estimable functions are linear combinations of 
Xd. The expected value of vy E(y), is equal to Xd. By 
Gefinition, if q'd is estimable, then q'd = t'E(¥) for some 
<a SO 

q'b = t'Xd. (5) 
The concept of estimability does not depend on the value b, 


so equation (5) must be true for all values of D. Therefore 
a! _ etx 


Ad 
for some vector t’. We thus arrive at the following important 


ewamac lem Zation: 


ad 


q'b is estimable if and only if q' = t*°X for some t'. 


2 omsbiycueanGe tO Whatever 


3. When a'b is estimable, a'b 
solution b° of the normal equations is used. This is true 


because, by the previous property, for some t’', 
q'b° = Fexbo = Bexexy 


and XGX' is invariant under selections of G (Theorem 1). Thus, 
q*b° tmimeeeiant under choices of G and hence to Dos when 


q'd is estimable. 


TZ 











4, The following theorem / Ref. 10_/ provides a procedure 
for checking the estimability of q’b. 
i@eesen 2, A given function q°b is estimable if 

amd som y  1t aq°H = a'. | 

Proof: If q'b is estimable, then the definition 
of estimability implies that g° = ¢'X for 
some t', and q'H = £'*XH = +'XGX °X = +X 
by Theorem 1. If G"H = q*, then 


o~ ~ & at of vd 
Se oO Ge, = 4°X for t' = g'GxX, 


Pe LE STABEE HYPOTHESIS 
A testable hypothesis is a hypothesis that can be expressed 
in terms of estimable functions. Assume that a null hypothesis 


takes the form 


ae d > = 0. (6) 


If the null hypothesis q'b = 0 is to be tested by the analyst, 
then q*b° will be part of the test statistic which will need 
to be invariant to b° (detailed proof contained in Ref. 10). 
As discussed earlier, q° 5° is invariant if q°b is estimable. 
Thus by applying Theorem 2, if q'H = q' then q'b is estimable 
and alae a' db ZQmlome testable hypothesis. Similarly, if 
q'H 7 a’, then q'b is not estimable, and therefore the hypothesis 
q'b is not a testable hypothesis. In the testing of a hypo- 
thesis, Searle / Ref. 10 / proves that if an analyst uses the 
standard analysis of variance procedure to "test" the hypothesis 
given by equation (6) when in fact a'b is not estimable, the 
actual hypothesis tested is 

IG: 








| a 
Hie 31 ¥ lelle 0 
The discussions concerning estimability of q'b and the 
testing of the hypothesis Hera" = 0 also can be applied in 


the form of Q'd for 
Meee for i=1,..., 6, 


where b is a pei vector, qs temexp, and @" is@asxp matrix. 
Thus Q'b is estimable if and only if Q°H = Q', and the hypo- 
thesis Ho: Q'd = 0 is testable if Q'b is estimable. 

The next chapter describes methods of computing H, and 
the use of the matrix H will be demonstrated in examples in 


Chapter IV. 
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Pee anos OF DETECTING THE MATRIX H 


Determination of the matrix H is useful in answering 
questions concerning estimable functions and testable hypoth- 
eses. As discussed in Chapter II, the function G°D is esti- 
mable if and only if q°H = q°, and the hypothesis Ho: q'b = 0 
is testable only if q°b 1S €stimable. Recalling that H = GX'xX 
witere Gis a generalized inverse of the matrix X'X, the pro- 
blem indirectly becomes one of computing the matrix G and 
then performing the matrix multiplication to obtain GX'X, 

For many designs encountered in practice, the matrices X, X', 
and G are of large dimensions. Two approaches for computing 
H will be discussed and demonstrated by examples. The first 
approach will be the straightforward mathematical approach; 
but the most practical method appears to be one utilizing an 
analysis of variance (ANOVA) computer program. The approach 
using ANOVA computer programs is practical because the analyst 
would presumably be using a program of that type to analyze 
test data anyway. As will be discussed, with a simple modifi- 
cation of program inputs, the matrix H can be computed using 


the ANOVA program. 


A. DIRECT MATHEMATICAL COMPUTATION OF MATRIX H 
Although the direct approach is demonstrated by a sample 
problem with a relatively small dimensional design matrix Ie 


this approach provides insights into determining estimability. 


i 








For ease of discussion, the sample problem will be presented 
as a step-by-step procedure. 


For this example, the model can be written in the form 


Yi, =m ta, + Cig renee ily 2 ad i= lie 2, 


In matrix form this becomes 


Yq1 il: af 0 er4 
Yi0 iit ale 0 m C15 
= Se 
Yay a. 2 2, 
ae 
Y22 ao G0 


1. Gompute X'X 
As described in the Chapter II, the transpose product 


of X is desired throughout the computations. 


1. i See i eo 4 2 2 
X*X = ii 0 0 1 1 O|j= 1 N: 
0 Oy elt It Jie) 0) ail ZO ee 

i > eOwe . 


Note Cia veuleceumank Of ATX 1s 2. 
2. Compute G 
several methods are available to describe the compu- 
tation of the generalized inverse of a matrix / Refs. 8 and 
10 7. The method in this example is described by Searle 


Taroom eme@he order of X'X is 3 and its rank is 2, so 


16 








the first step is to delete (3-2) rows and corresponding 
columns from X'X, to leave a sub-matrix of full rank 2 
called (X'X) . For this example the first row and column 


are deleted to yield 


2 0 
(X'X) = 
0 2 
S$ 0 
and (X'x)7? = 
nN 0 a 
2 


Thus G, a generalized inverse of X'X, is determined by 
replacing all elements of (X'X) by those of its inverse 
and putting in zero for all other elements of X'X. Applying 


this procedure yields 


oF 0 0 
G = O -s O 
02 0 Vas 


As a check to see if G is a generalized inverse of X'X, 


femermine if X'XGX*X = X*K: 


4H 2 2 O00 Db Be? Oe il ee 
MixGeaee— {| 2 2 0 Oo ¢ 0 Caer | © Oo yia2e 2 0 
emo. 2 O80 “s 2a 2 OmnO lve ec 0 2 
4 2 2 
22 Oe x. 
20 ee 


Ef 





3. Compute H 


ive=computation of HM is a direct matrix multiplication. 


o a ae ft, 1 2 He, ae) 
ocean Oo + 0 ime 40) i eo 
0 0 4 Ome Osea 1 i Ome 
| 20) al 
0 Sore Wey 2 2 OMEO) 6 
iO = 0 2 2 (0 | | a (2 
0 0 4 2 0 2 JL ea. 


4, Compute H Using Alternate Method 
Two equations are useful in demonstrating an alternate 
method for computing H: 
H = GX°X 


wo v A) 
Bao gGey, ior any y. 


5 is the cue 


AJ 
column of the design matrix X, then a solution b can be 


If the v used for computation is X where x 


Gompuced for each j. Partitioning the matrices X and H, the 


following result is noted: 


— 


lad 


7 x) =GX'X, 
p 


i low ~ 
' = t 
ee 6 on ) GX ( aa 


0 


O!' w 
1 : Yo 


re 


Be 


Aw e 
Pommmicmtiiescis the columns of the matrix X, zo Will be 


referred to as "pseudo-data." 
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To demonstrate this procedure of Senputine H, the 


previous example will be used. 


computing 
~ O 
b, = oes Pome. 1, “2 3, 
Vv oO rv 
— felyeu 
by GX xy 
O70 0 Le Ge ea ai 0 
= 10 $ 0 ii 0 6 peer |r 
0 0 4 OMe) <a i 1 
\2 
Similarly, 
0 
eo ! Pe 
by GX Xo = ih 
0 
and 
0 
Vv" O ~_ 
D = GX? = O 
3 *3 
1 
OO 0 
RO We oO ! WO 
Thus H = (b, bo 93 ele ee) 
eee) =: 


Of course H computed using this method is 


H determined in paragraph 3. 


in ieeme God 


The solution becomes one of 


(8) 


, as before.(9) 


the same as the 


of computing H is 


important because an ANOVA computer routine can be used to 


“wo 
solve for the columns of H, using pseudo-data in place of jy. 





iy 





B. USE OF ANOVA ROUTINE TO COMPUTE THE MATRIX H 


In section A two methods were discussed for computing H. 
The simplicity of the design matrix X used in the example 
permitted easy computations. A more realistic design matrix 
in practice would be of large dimensions that such hand compu- 
tations might become impractical. Computers provide a means 
of solving this problem. One approach on a computer would be 
a direct mathematical approach using available computer routines 
for computing G, then H. Three routines would be required - 
compute H: one routine to compute X'X; a second routine to 
compute the generalized inverse of the matrix X'X; and a third 
routine to compute the matrix product of GX°X to attain H. 
imeesemethod is straightforward but requires detailed user 
knowledge by the analyst. The limitations of this approach 
are based on the limitations of the computer routines used, 
which normally are in the form of dimension capacities. 

A second approach using an ANOVA computer routine may be 
more practical for an analyst, since an ANOVA routine will 
probably be used to analyze the data from the experiment any- 
way. The procedure to be used involves the use of pseudo- 
data as described in step 4 of section A. Pseudo-data is 
analyzed by the routine, with the results of interest being 
the regression coefficients as determined by solving the 


normal equations (equation (1)) under the full model hypothesis. 
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This procedure can be demonstrated by an example using 
the General-Linear Hypothesis routine, BMDO5V, of the BMD 
Biomedical Computer Programs / Ref. ele / Suppose that the 


model is 


-. = eles I : Het A i= : 
V4.5 m a C5 J OG C5 BOG! ly 5 


leet 


In matrix form this model can be written in the general form 





ag mmo od © 60 1 0 0 0 0 0 m ia 
Yi2 oro IO Oo | 06 0 OO O ay Cio 
Y13 Momo Onm oO od oO 8 1 0 0 O a5 e153 
zx a Gke)) 

Yo4 Moret oO. 0 0 0 0 1 0 06 om Con 
oo meomernor) 0 0 0 0 0 1 0 a 255 
Vn3 momo O 1 0 0 0 0 0 1 C5 P05 | 

eG oy 

aC. 5 

can 

aol 

eye) 

23 


Each column of the design matrix is entered as pseudo-data. 
The computer output lists estimates of coefficients corres- 
ponding to the full model hypothesis. This gives b5° for the 


ue column of pseudo-data. Using the smaller dimensional 


raat 





notation required by the BMDO5V routine causes the coefficient 
vector to be in a reduced form. This can be expanded to the 


~ 0 : : at 
tated DE by applying the “usual linear restrictions" on the 


parameters given by 


= 


a. = QO 


These linear restrictions are common restrictions placed on 
the parameters in ANOVA problems, since by using these 
restrictions, the "reduced" design matrix may become of full 
rank, which in turn aides in solving the normal equations. 
The matrix H can then be formed as previously discussed by 
taking 


alo, 
b 


= ! 
Ho= (Dy beet 


Les ). 


Using the design matrix from equation (10) and the BMDO5V rou- 


tine, this procedure was followed to determine: 


Ze 





Meet 2 1/3" 1/3 Wer /6m/6 1/6 1/6 1/6 1/6 

ee=172° © 0 eee Get /69 1/6 9/6 -1/6 21/6 
=e /2 0 0 Oe = -1/6 4176 1/76 1/6 
i 0 Ware 5e-1/3°-1/73 1/3 -1/6 -1/6 1/73 -1/6 -1/6 
0 Diem 2/2 -1/3 -1/6 91/3 =1/6 =1/6 1/3 -1/6 
f= 10 =O Wee 39-973 2/3 -1/6 -1/6 1/3 =1/6 -1/6 1/3 


© 


© 


© 


le 


Oo O 0 0 0 O 1/3 -1/6 -1/6 -1/3 1/6 1/6 
0 0 0 0 0 0 -1/6 1/3 -1/6 1/6 -1/3 1/6 
Oo 0 0 0 0 Om =1/6 21/6 (W/o 6 ore 


oO Bae 0 0 0 Ole 1/5 1/6 1/6 3 ely Cee 
Ome 0 0 0 OM) Cea 3 ay ce Sy 1G 


0 0 0 0 6) O 1/6 1/6 -1/3 -1/6 -1/6 1/3 ) 


momareneck; this Matrix H satisfies the requirement that 
X*XGX'*X = X*KH = KX, 

If the transpose product of the design matrix used as 
input for the BMDO5V routine is singular, then the BMDO5V 
routine will terminate with an error statement. In such a 
case, the design matrix must be modified so as to accomplish 
the required results. 

To outline the procedure in such a case, consider the 
model 


=mt a. af C; a yp (6 Lae 3 acd. aso 6 Clee ase arslole esas 


Yijk k jk jk 1k 


ace. + C. moe thea eee, | = 1, 2s kk = 1,2. 


13 ale a 
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=> 


: 


[ 





This model is a 23 factorial design, and for this example 


iene Will be observations in only four of the eight cells 


% replication). Suppose observations are numbered and 


placed as shown below: 





The cells containing numbers represent the locations of the 
observations. The transpose of the design matrix X and the 
barameter vector b for this model are shown in figures 1 and 

2. The first step in computing H will be to determine the form 


of the reduced design matrix 2, which would be required by 


the BMDO5V routine. Thus the form would be 


ToC a alee ee ee 

(ier bw: «0 
_ oO wee) Sle O > 0 | 

iOs Ome tel O BO? A 


This design matrix 2 will not run on BMDO5V because the product 
2'Z is singular. To reduce Z to a form acceptable to the 
computer routine, determine a set of linearly independent 
columns of Z and write these columns as the matrix A. For 


this example, the first four columns suffice: 
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ee =. ss C—O 
Seo 1 OO Co 4a oO oH. 6 o O81 oC Co o oo once & 


Beet oO OO rm OOO © OG A OC OC OC HO oO Oo oO Oo o Oo 6 aco 
fe eo eS re oS 1) Ory © OO OO: 6 Oa.o iol a olaome 


Meee Ge oh He OOOO Gor. O Ho O16 <.o oOo oma tono & 


ON ee 


» 


spose of Design Matrix for = Replication of 


Factorial Design. 


2 


I 


PAGURE 1. 


a 
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FIGURE 2. 


oe 
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aC 59 ) 


Parameter Vector hor 
Factorial Design. 
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Replication of a 2? 





J a ee 
i: t ©» O 
A = ; 
ion tO - 
\a Be 19) 1} 
t 
twee that 4 is of the form Z = (AY A). Since A is nonsingular, 


so is AtA, and the inverse of A'A exists. Let oe (a°A) 72 
and Hy, = Gy, (AIA) = (AA) ~*(a°A) = I, The remaining steps are 
to compute G,, the generalized inverse of Z'& and then to 
compute Ho. Observe that 

A' A's 


2°42 = (A i) = eee 
A‘ A'A 


Using tne fact that A’A Gp A'S = A'A, then Gy is of the form 


SP | | 

Gp 1 Gn A'tA ~ 

_ ) ean ete ee etttti‘“‘iéR CT be ete et 

H, = Gp 2°2 & 1G, rary 

i 

t 
G,A'A | GaAtA ri 
— 2 f[----=---- Oi Fae — 1 See = 

2 

Gah A , GyAts Al i 


where I is the 4x4 identity matrix. This H, is in a reduced 
form and thus must be expanded from its 8x8 dimension to the 


Bex /7 iH matrix. Additional columns can be added (i.e., rows 
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expanded) based on the fact that each of the remaining columns 
can be expressed as a linear combination of the matrix Hypo» 
where Hp» consists of the first four columns of Hy « The 

row expansion of Hoo Will produce the 8x27 dimensional matrix 
Hag « To expand the rows, recall that each column of the 
design matrix X, denoted by Xe EOr ai 55. wae? ecine toe 
expressed as a linear combination of the matrix A (the first 


four columns of X). This linear combination can be expressed 


as 
AV. = X., Poe Teel Ge os 4 5 27. 


Since A is nonsingular, a+ exists and 


v - AK. for j= 5, ..., 27. 


The same linear relationship exists between each column of 


1a 
Hog and Hoo: 


the corresponding column of pseudo-data from the design matrix 


Letting - be the j column of Hoy and > be 


Peraeure 1), h, can be computed as 


v., ois j=) On 9 Aeeees 27 in tis cexanpile. 
J 


Each column can be expanded (i.e., additional rows added) by 


applying the same restraints as before, in the form: 


a a.= por = a =?) acd. ’ = Lacd,. «4, = Q 
, 1 j j i k ; ijk 5 J 
ad: & = cd. = > cd. = = 
ia ijk F jk ae ike 2 ads, 0 (12) 
= e e = ac. e = (ee 
and Zadix 2203 j ? ij 





Using this procedure as described, the 27x27 H for the 


= ceplication of a 27 factorial experimental design was 


obtained. It is shown in Figures 3 and 4 where H = (Hy 
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IV. APPLICATIONS OF THE MATRIX H 


The discussion in Chapter II as to the ILi(iholorannsteters) Yer 
the matrix H in answering an experimenter's questions on 
estimability and testability can be applied to the three 
examples in the previous chapter. In examining these examples, 
the concept of confounding can be mathematically explained 
and illustrated. Confounding as defined by References 1, 2, 
3, 5, 6 and 7 is the designing or arrangement of an experi- 
ment in such a manner that certain effects cannot be distin- 
guished from other effects. The references previously cited 
discuss the methods for intentionally confounding certain 
effects, normally the higher order interaction terms, with 
other effects for fractional replications in several different 
kinds of experimental designs. As discussed in Chapter II, 
the linear combination of the parameters q°b is estimable if 
aga only if q°H = @',. As stated before, a hypothesis is 
testable if it consists of estimable functions; so if the 
hypothesis Hy? a'b = 0 is "tested" with the standard ANOVA app- 
each, and q°D is not estimable, the hypothesis actually tested 
tom the form ae : q°HD = 0. From the expression q’Hb the 
analyst can determine which effects are confounded in the 
design. This determination of confounding can be illustrated 


through examples. 


Tn the first example, the matrix H was determined to be 


D2 





Consider the hypothesis Ho: a, - ay = O which implies 


q* = ( i=). For’g'b to be estimable and thus 
Hj: q'b = 0 testable, the condition q'H = q' must be met. 
0 0 0 
ws rad 
Meco -1) |t 1 0} = (0 1 -1) = q? 
% © al 


Therefore, the hypothesis Ho: a, - ap = 0 is testable. Consi- 


der a different hypothesis, say Hota, 


O. Then gq? = (0 1 0). 


Checking for estimability: 


0 0 0 
G@tH=(0 1 0) {1 1 of =(1 1 1) 44°. 
ose. 


Therefore, the Hota, = 0 is not testable but the hypothesis 

fwd “ 
tested actually is of the form Ho: Qi ibe =" 10 Gis a m+ a, = 0. 
This result means that the ay effect is confounded with the 


mean t 
In the second example, H was given by equation (1). 
Checking the testability of the hypothesis Ho? a, - ay = 0 


where 


acco 1 -1 0 0 © 0 0 0 0 © 0), 


33 





q'H does equal q's; therefore, the hypothesis Ho: q'b = 0 is 
testable. Suppose that the hypothesis Ha? Cy - Cy = Og 
where q' ——oommom Oo ft -? 0 0 0 0 0 O O), is tested. 
In this case q°H 1s not equal to q°, and therefore the 
hypothesis is not testable. If the hypothesis HO: C) - Cy = 
is tested using standard ANOVA test statistics, the actual 


hypothesis tested would be: 


PAH = ( : a ae = 
4 8 elas 0 0 0 21 -2 0 $ -§ 0 $ -% 0) D= 
C) - Cy t 2aCy 4 - ZACy 5 + BAC5) - ZAC a5 = 


Confounding obviously is present in the form of the two way 
interaction terms ACi a The analyst may suspect that the 
interaction terms are not Significant, in which case they 


can be disregarded. In the present case, if 2 acy, = 0 
al 


for j = 1, 2 is assumed as part of the model, again Ho is 
bestable. 

The 4 replication of the 23 factorial design produces 
confounding of many effects which can be illustrated for the 
case of "testing" the hypothesis inh a, 7- ap = Opel Cn uans 


example q? is 1 x 27 and is of the form: 
q! = (0 1 -1 0 e e a Ohne 


ne 2 ° ve , ° 
It is easily verified that q'H # q; iijollyalovee Weyer (ef ' lo aus) Ieee 
estimable, and therefore the hypothesis H,: a, - a) = QO is 


not testable. By computing q'H», the form of the confounding 


can be obtained: 
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V. CONCLUSIONS AND RECOMMENDATIONS 


The matrix H, once computed, is valuable to the analyst 
for determining the estimable linear functions of the para- 
meter vector and for ascertaining which hypotheses are test- 
able. As discussed in the previous chapter, information 
concerning confounding can be gained through H. 

Of particular note is the fact that the computation of 
the matrix H is dependent on the design matrix X. Although 
an experiment may not have been conducted in accordance with 
original design, the design matrix X will be known. The 
methods of computing H discussed in this thesis have limit- 
ations. The direct mathematical approach is limited by the 
constraints of the computer routines used to compute X°X, the 
generalized inverse matrix G of X*X, and thus H. The second 
method using the analysis of variance routine with its 
dimensionality constraints requires knowledge in the reduc- 
tion in dimensionality of the design matrix and the row 
expansion of H, but this knowledge would be required to use 
the ANOVA routine anyway. The second method is constrained 
by the fact that the ANOVA routine will not run for a singu- 
lar "reduced" X'X; this constraint led to a modification of 
the second method. The limitations of the modified method 
also would be in the form of computer program constraints in 
computing matrix inverses, solving systems of linear equations 
to expand the columns, and expanding the rows using the 


linear restrictions (equation (12)) imposed by BMDOSV. 
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A special purpose computer program to develop H could 
be written. A desired computer routine would be one that 
would accept as input the reduced design matrix and a set 
of q's and produce as output the corresponding matrix H, 
and q°H's. Although such a computer program might have some 
of the limitations discussed previously, the program could 
answer the user's questions concerning estimability and 
testability. As discussed in the previous chapter, the user 
could gain information as to which effects were confounded, 
and this in turn would aid in making decisions as to the 


significance of this confounding in his experiment. 
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